Representing discourse coherence: A corpus-based analysis

نویسندگان

  • Florian Wolf
  • Edward Gibson
چکیده

We present a set of discourse structure relations that are easy to code, and develop criteria for an appropriate data structure for representing these relations. Discourse structure here refers to informational relations that hold between sentences in a discourse (cf. Hobbs, 1985). We evaluated whether trees are a descriptively adequate data structure for representing coherence. Trees are widely assumed as a data structure for representing coherence but we found that more powerful data structures are needed: In coherence structures of naturally occurring texts, we found many different kinds of crossed dependencies, as well as many nodes with multiple parents. The claims are supported by statistical results from a database of 135 texts from the Wall Street Journal and the AP Newswire that were hand-annotated with coherence relations, based on the annotation schema presented in this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Representing Discourse Coherence: A Corpus-Based Study

This article aims to present a set of discourse structure relations that are easy to code and to develop criteria for an appropriate data structure for representing these relations. Discourse structure here refers to informational relations that hold between sentences in a discourse. The set of discourse relations introduced here is based on Hobbs (1985). We present a method for annotating disc...

متن کامل

Discourse and Coherence: Revisiting Specific Conventions of the Centering Theory

This paper discusses a corpus-based study whose aim is to evaluate specific conventions of the centering theory and to establish whether they should be revisited. In particular, the study explores the relation between discourse coherence and several parameters such as the definition of an utterance, the varieties of anaphora considered, the forms of the discourse entities and the type of genre.

متن کامل

Modeling Discourse Coherence for the Automated Scoring of Spontaneous Spoken Responses

This study describes an approach for modeling the discourse coherence of spontaneous spoken responses in the context of automated assessment of non-native speech. Although the measurement of discourse coherence is typically a key metric in human scoring rubrics for assessments of spontaneous spoken language, little prior research has been done to assess a speaker’s coherence in the context of a...

متن کامل

The Analysis of the Discourse Markers in the Narratives Elicited from Persian-speaking Children

Discourse markers (DMs) are linguistic elements that index different relations and coherence between units of talk. Most research on the development of these forms has focused on conversations rather than narratives. This article examines age and medium effects on use of various discourse markers in pre-school children. Fifteen normal Iranian monolingual children, male and female, participated ...

متن کامل

Research Article Introductions: Sub-disciplinary Variations in Applied Linguistics

The present study aimed to investigate the generic organization of research article introductions in local Iranian and international journals in English for Specific Purposes, English for General Purposes, and Discourse Analysis. Overall, 120 published articles were selected from the established journals representing the above subdisciplines. Each subdiscipline was represented by 20 local and 2...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004